I am an assistant professor in the School of Operations Research and Information Engineering and Cornell Tech at Cornell University. I am a field member of ORIE, Center for Applied Mathematics (CAM), Statistics, ECE, and CS. I am affiliated with WCM Institute of AI for Digital Health. Prior to Cornell Tech, I was a postdoctoral fellow at the Statistical Reinforcement Lab at Harvard University. I received my Ph.D. in Operations Research in 2022 from Carnegie Mellon University. I received my B.A.s in Mathematics and Economics from Smith College in 2017.

My lab aims to develop a robust, end-to-end framework for generating and implementing evidence-based personalized healthcare strategies that are statistically sound and practically feasible in real-world clinical settings. Our work is fundamentally motivated by the unique challenges of healthcare applications, where high-stakes decision-making, limited sample sizes, and the need for minimal yet defensible statistical assumptions demand methods that are not only rigorous but also efficient and interpretable.

The pipeline begins with leveraging observational data to generate initial evidence and hypotheses using scalable causal discovery methods that are efficient under finite-samples and robust under unobserved variables. This involves designing novel causal discovery algorithms, improving the efficiency of nonparametric conditional indepndent tests, and leverge modern machine learning methods such as diffusion to handle unobserved variables.
To validate these findings, we focuses on the effcient design of adaptive clinical trials (such as micro-randomized trials (MRTs) and N-of-1 trials) under practical constraints, which provide rigorous, dynamic, and individualized causal evidence. Our methodology involves two distinct strands: (1) designing approximation algorithms for constrained, simpler problems, and (2) reformulating restless bandit problems to develop algorithms with stronger finite-sample guarantees.
Further, we are interested in developing efficient and robust statistical inference methods in varies settings to ensure that results are statistically sound and reproducible. This includes developing theory and tools that are valid for non-adaptively collected data, data generated through adaptive algorithms (e.g., bandits or sequential trials), and decentralized (federated) data environments.

We are broadly interested in translating research into practice by collaborating directly with clinicians to tackle pressing challenges: improving heart failure care, creating interpretable clinical LLMs, and leveraging diffusion models for single-cell genomics. A copy of my CV can be found here.



Funding and Awards

My research is supported by
  • AWS Credit Grants – Cornell’s Center for Data Science for Enterprise and Society

  • My work has been recognized by the following awards:
  • Finalist, 2023 INFORMS DMDA Workshop Best Paper Competition -- Theoretical Track
  • Winner, 2021 INFORMS Pierskalla Best Paper Award
  • Winner, 2021 CHOW Best Student Paper in the Category of Operations Research and Management Science
  • Finalist, 2019 INFORMS IBM Service Science Best Student Paper Award
  • Tata Consultancy Services Fellowship, Tepper School of Business, CMU, 2020
  • Ann Kirsten Pokora Prize, Department of Mathematics, Smith College, 2017
  • Group

    PhD Advisees:
  • Sujai Hiremath (ORIE)
  • Diyang Li (CS)
  • Wenxin Chen (CS, coadvised with Fei Wang)
  • Dominik Meier (CS, coadvised with Raaz Dwivedi)
  • Jiamin Xu (ORIE)
  • Shixing Yu (ECE)
  • Mengxiao Gao (ORIE)
  • Jacqueline Maasch (CS, Minor advisor)

  • Current and Past Student Collaborators:
  • Xueqing Liu (PhD student at Duke-NUS Medical School -> Postdoc at Harvard, Fall 2025)
  • Mengxiao Gao (Undergrad student at Tsinghua University -> Cornell ORIE PhD program, Fall 2025)
  • Flavia Jiang (Undergrad student at Cornell -> University of Chicago Data Science PhD program, Fall 2025)
  • Ruiyang Lin (Undergrad student at the University of Science and Technology of China -> Washington University in St. Louis Data Science PhD program, Fall 2025)
  • Austin Zhao (Undergrad student at Cornell)
  • Qingshan Xu (Undergrad student at University of Electronic Science and Technology of China)
  • Methodological Work

    • When Additive Noise Meets Unobserved Mediators: Bivariate Denoising Diffusion for Causal Discovery with Dominik Meier, Sujai Hiremath, Promit Ghosal (arXiv)

    • Optimal Adjustment Sets for Nonparametric Estimation of Weighted Controlled Direct Effect with Ruiyang Lin, Yongyi Guo (arXiv)

    • MOSIC: Model-Agnostic Optimal Subgroup Identification with Multi-Constraint for Improved Reliability with Wenxin Chen, Weishen Pan, Fei Wang (arXiv)

    • From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-Horizon Performance with Jiamin Xu, Ivan Nazarov, Aditya Rastogi, Africa Perianez (arXiv)

    • Online Uniform Sampling: Randomized Learning-Augmented Approximation Algorithms with Application to Digital Health with Xueqing Liu, Esmaeil Keyvanshokooh, and Susan A. Murphy (arXiv)

    • LoSAM: Local Search in Additive Noise Models with Mixed Mechanisms and General Noise for Global Causal Discovery with Sujai Hiremath, Promit Ghosal (arXiv) (UAI 2025)

    • Reward Maximization for Pure Exploration: Minimax Optimal Good Arm Identification for Nonparametric Multi-Armed Bandits with Brian Cho, Dominik Meier, Nathan Kallus (arXiv) (AISTATS 2025)

    • CSPI-MT: Calibrated Safe Policy Improvement with Multiple Testing for Threshold Policies with Brian Cho, Ana-Roxana Pop, Sam Corbett-Davies, Israel Nir, Ariel Evnine, Nathan Kallus (arXiv) (KDD 2025)

    • Local Causal Discovery for Structural Evidence of Direct Discrimination with Jacqueline Maasch, Violet Chen, Agni Orfanoudaki, Nil-Jana Akpinar, Fei Wang (arXiv) (AAAI 2025)

    • Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models with Sujai Hiremath, Jaqueline Maasch, Mengxiao Gao, Promit Ghosal (arXiv) (NeurIPS 2024)

    • Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams with Brian Cho and Nathan Kallus (arXiv)(ICML 2024)

    • Kernel Debiased Plug-in Estimation: Simultaneous, Automated Debiasing without Influence Functions for Many Target Parameters with Brian Cho, Yaroslav Mukhin, and Ivana Malenica (arXiv)(ICML 2024)
    • Finalist, 2023 INFORMS DMDA Workshop Best Paper Competition -- Theoretical Track

    • Local Discovery by Partitioning: Polynomial-Time Causal Discovery Around Exposure-Outcome Pairs with Jacqueline Maasch, Weishen Pan, Shantanu Gupta, Volodymyr Kuleshov, Fei Wang (arXiv) (UAI 2024) (NeurIPS Causal Representation Learning Workshop 2023)

    • Improving Treatment Responses through Limited Nudges: A Data-Driven Learning and Optimization Approach with Esmaeil Keyvanshokooh, Yongyi Guo, Xueqing Liu, Anna L. Trella, and Susan A. Murphy (Journal version)
    • Contextual Bandits with Budgeted Information Reveal with Esmaeil Keyvanshokooh, Xueqing Liu, and Susan A. Murphy (arXiv) (AISTATS 2024)

    • Toward a Liquid Biopsy: Greedy Approximation Algorithms for Active Sequential Hypothesis Testing with Su Jia, Andrew Li, and Sridhar Tayur (SSRN) (Management Science, 2025)
    • Greedy Approximation Algorithms for Active Sequential Hypothesis Testing with Su Jia, and Andrew Li (NeurIPS 2021)
    • Winner, 2021 INFORMS Pierskalla Best Paper Award

    • Causal Inference with Selectively Deconfounded Data with Andrew Li, Zachary Lipton, and Sridhar Tayur (AISTATS 2021) (Journal version) (NeurIPS CausalML Workshop 2019)

    • Optimizing Wearable Devices in Personalized Opioid Use Disorder Treatments Under Budget Constraint with Yanhan (Savannah) Tang, Alan Scheller-Wolf and Sridhar Tayur (Journal version)
    • CHOW best paper in the category of operations research/management science, 2021
    • Finalist, 2019 INFORMS IBM Service Science Best Student Paper Award

    Review, Workshop Publication, and Applied Work

    Teaching

    • Fall 2023, 2024, 2025, CS 5785/ORIE 5750/ECE 5414, Applied Machine Learning
    • Spring 2025, ORIE 5217: Digital N-of-1 Trials and Their Application
    • Spring 2024, ORIE 7790: Selected topics in Applied Statistics - Statistical and Optimization Methods for Decision-Making in Healthcare

    Selected Talks

  • 8th Rotman Annual Research Roundtable: Data Analytics in Healthcare, Local Causal Discovery for Structural Evidence of Direct Discrimination for Liver Transplant, Toronto, May 23, 2025
  • Cornell AI Seminar, Efficient Local and Global Causal Discovery: Methods Leveraging Causal Substructures for Improved Finite Sample Performance, Virtual, Feb 14, 2025
  • Workshop on Individualized Decision Making, panelist on implementation challenges of individualized decisions, Berkeley, July 18-19, 2024
  • Conference on Health, Inference, and Learning (CHIL), moderator, the panel on behavioral health and economics, New York, June 27-28, 2024
  • pre-ENAR workshop on Statistical Methods for Digital Health Technologies Data, Online Uniform Risk Times Sampling, Baltimore, March, 9, 2024
  • Learning on Graphs New York, A gentle introduction to causal discovery and local causal discovery, Jersey City, March 1, 2024
  • ITA Workshop, Online Uniform Risk Times Sampling, San Diego, Feb 20, 2024
  • Cornell Center for Applied Mathematics Colloquium, Kernel Debiased Plug-in Estimation , Ithaca, Feb 9, 2024
  • IMSI workshop on Machine Learning and Artificial Intelligence for Personalized Medicine, Budgeted Information Reveal in Sequential Experiments, Chicago, April 17, 2023
  • Harvard Statistics Colloquium, Greedy Approximation Algorithms for Active Sequential Hypothesis Testing, Cambridge, October 31, 2022
  • Workshop on Quantifying Uncertainty: Stochastic, Adversarial, and Beyond, Simons Institute for the Theory of Computing, Greedy Approximation Algorithms for Active Sequential Hypothesis Testing, Berkeley, September 12, 2022